3 Common Distributions

#BernoulliDistribution #BinomialDistribution #GeometricDistribution #NegativeBinomialDistribution #BernoulliProcess #PoissonDistribution #MultinomialDistribution

Problem of the Day:

Alice and Bob are playing the following guessing game:
1. Bob writes down two different numbers on two separate cards: $X, Y > 0$ .
2. Alice picks one of the cards u.a.r. and looks at the number.
3. Alice wins if she correctly guesses which of the two cards has a larger number.
Can Alice do better than random guess?
Yes! Suppose Bob's numbers are $X < Y$ . Let $A$ e the number Alice picked.

if $R > A$ , call the other card as having a larger number.
if $R < A$ , call $A$ to be the larger number.

Denote $a = P (R < X), P (X < R < Y) = b, P (R > Y) = c$ , then $\begin{aligned} P (Alice is correct) \\ = & P (correct | A = X) P (A = x) + P (correct | A = Y) P (A = Y) \\ = & \frac{1}{2} (a + b + b + c) = \frac{1}{2} + \frac{b}{2} > \frac{1}{2} . \end{aligned}$

1 Distribution Derived from 0-1 Sequence

Consider a sequence of $0$ s and $1$ s. (call this Bernoulli Process) Denote each index a RV $X_{i}, i = 1, \dots, n$ .

1.1 Bernoulli Distribution

If $X \sim Bernoulli (p)$ , then $P (X = 1) = p, P (X = 0) = 1 - p$ . $(0 < p < 1)$
$E [X] = p, Var (X) = E [X^{2}] - E [X]^{2} = p (1 - p)$ .

If $X_{1}, \dots, X_{n} \overset{i . i . d}{\sim} Bernoulli (p)$ , then call this a Bernoulli process. Let $S_{n} = X_{1} + \dots + X_{n}$ . This leads to:

1.2 Binomial Distribution

$S_{n} \sim Binomial (n, p)$ , then $P (S_{n} = k) = (\binom{n}{k}) p^{k} (1 - p)^{n - k}$ .
By linearity of expectation, $E [S_{n}] = E [\sum_{i = 1}^{n} X_{i}] = \sum_{i = 1}^{n} E [X_{i}] = n p,$
By independence of $X_{1}, \dots, X_{n}$ and (3.1), $Var (S_{n}) = \sum_{i = 1}^{n} Var (X_{i}) = n p (1 - p) .$

1.3 Geometric Distribution

Denote $W_{i}$ as the waiting time between the $(i - 1)$ th and $i$ th successes. Then $W_{i} \sim Geometric (p)$ . $P (W = k) = (1 - p)^{k - 1} p, k \in N^{*}$ .
Then $\begin{aligned} E [W] & = \sum_{k = 1}^{\infty} k (1 - p)^{k - 1} p = \frac{1}{p}, \\ Var (W) & = E [W^{2}] - E [W]^{2} = \sum_{k = 1}^{\infty} k^{2} (1 - p)^{k - 1} p - {(\frac{1}{p})}^{2} = \frac{1 - p}{p^{2}} . \end{aligned}$

These moments can be computed more easily using MGF.
By here we know the MGF for $W \sim Geometric (p)$ is $M_{W} (t) = \frac{p e^{t}}{1 - (1 - p) e^{t}},$ so $\begin{aligned} E [W] & = {\frac{d}{d t} M_{W} (t) |}_{t = 0} = {\frac{p e^{t}}{(1 - (1 - p) e^{t})^{2}} |}_{t = 0} = \frac{1}{p}, \\ E [W^{2}] & = {\frac{d^{2}}{d t^{2}} M_{W} (t) |}_{t = 0} = {\frac{p e^{t}}{(1 - (1 - p) e^{t})^{3}} |}_{t = 0} = \frac{1}{p^{2}} . \end{aligned}$

1.4 Negative Binomial Distribution

$r \in N^{*}$ . Use $T_{r}$ to denote the total waiting time to the $r$ th success. Then $T_{r} = W_{1} + \dots + W_{r}$ , $W_{1}, \dots, W_{r} \overset{i . i . d}{\sim} Geometric (p)$ . Then $P (T_{r} = n) = (\binom{n - 1}{r - 1}) p^{r - 1} (1 - p)^{n - r} \cdot p .$
Use $F_{r}$ to denote number of failures before $r$ th success, then $F_{r} + r = T_{r}$ . Denote $F_{r} \sim NegativeBinomial (r, p)$ . $P (F_{r} = k) = (\binom{r + k - 1}{r - 1}) p^{r} (1 - p)^{k} = (\binom{r + k - 1}{k}) p^{r} (1 - p)^{k} .$
Then $\begin{aligned} E [F_{r}] & = E [T_{r}] - r = \frac{r}{p} - r = \frac{r (1 - p)}{p}, \\ Var (F_{r}) & = Var (T_{r} - r) = Var (T_{r}) = r Var (W_{1}) = \frac{r (1 - p)}{p^{2}} . \end{aligned}$

2 Hypergeometric Distribution

There are $B$ blue balls and $R$ red balls. $N = B + R$ . Sample size $n < N$ .

If sample $n$ balls u.a.r with replacement, let $X (ω)$ be the number of blue balls in $ω \in Ω$ (sample space). Then $X \sim Binomial (n, p)$ , $p = \frac{B}{N}$ .

If sample $n$ balls u.a.r without replacement, let $X (ω)$ be number of blue balls. Obviously, when $k > B$ or $n - k > R$ , $P (X = k) = 0.$
For $max {0, n - R} \leq k \leq B$ , consider $ω$ as an event that $X = k$ . Denote $q = P ({ω}) = \frac{\frac{B!}{(B - k)!} \frac{R!}{(R - (n - k))!}}{\frac{N!}{(N - n)!}} .$

@ Key observation: for another event $ω^{'}$ with the same condition, the probability is the same.

So there are $(\binom{n}{k})$ distinct sequences with $k$ blue balls and $n - k$ red balls. So $P (X = k) = (\binom{n}{k}) q = \frac{(\binom{B}{k}) (\binom{N - B}{n - k})}{(\binom{N}{n})} .$
This distribution is called Hypergeometric Distribution, $X \sim Hypergeometric (N, B, n)$ .

$P (X = k) \to (\binom{n}{k}) p^{k} (1 - p)^{n - k}$ as $B \to \infty, N \to \infty, \frac{B}{N} \to p$ .

2.1 Expectation

Define indicator RV $I_{i} (ω) = {\begin{aligned} 1, i -th draw is blue for ω \in Ω, \\ 0, otherwise . \end{aligned}$
Then $X = I_{1} + \dots + I_{n}$ , $(I_{1}, \dots, I_{n})$ is exchangeable. So $P (I_{i} = 1) = P (I_{1} = 1) = \frac{B}{N} = E [I_{1}]$ , so $E [X] = \sum_{i = 1}^{n} E [I_{i}] = n E [I_{1}] = \frac{n B}{N} .$

This is the same as expectation of $Binomial (n, \frac{B}{N})$ .

2.2 Variance

By exchangeability,

\begin{aligned} Var (X) & = Cov (I_{1} + \dots + I_{n}, I_{1} + \dots + I_{n}) \\ = \sum_{i = 1}^{n} Var (I_{i}) + \sum_{i \neq j} Cov (I_{i}, I_{j}) \\ = n Var (I_{1}) + n (n - 1) Cov (I_{1}, I_{2}) . \end{aligned}

Since $\begin{aligned} Var (I_{1}) & = \frac{B}{N} (1 - \frac{B}{N}), \\ Cov (I_{1}, I_{2}) & = E [I_{1} I_{2}] - E [I_{1}] E [I_{2}], \\ E [I_{1} I_{2}] & = P (I_{1} = I_{2} = 1) = \frac{B}{N} (\frac{B - 1}{N - 1}), \end{aligned}$
then $Cov (I_{1}, I_{2}) = - \frac{B (N - B)}{N^{2} (N - 1)} < 0,$ and $Var (X) = n \frac{B}{N} (\frac{N - B}{N}) (\frac{N - n}{N - 1}) .$

Yellow part is the variance of $Binomial (n, \frac{B}{N})$ ; green part $\leq 1$ .

3 Multinomial Distribution

Consider repeated trials. Each with $m$ types of outcome. (For Bernoulli trials, $m = 2$ ; for rolling a dice, $m = 6$ )
Denote $N$ as number of trials, $X_{i}$ as number of times type $i$ is observed.
Denote $(X_{1}, \dots, X_{m}) = \vec{X}$ ( $X_{1}, \dots, X_{m}$ are not independent given $N = n$ ), then $(X_{1}, \dots, X_{m}) | N = n \sim Multinomial (n, p_{1}, \dots, p_{m})$ , with $p_{j} \in [0, 1], \sum_{j = 1}^{m} p_{j} = 1$ .
A possible outcome $\vec{a} = (a_{1}, \dots, a_{m})$ should satisfy $\sum_{j = 1}^{m} a_{j} = n, a_{j} \in {0, \dots, n}$ . So $P (\vec{X} = \vec{a} | N = n) = (\binom{n}{a_{1}, \dots, a_{m}}) p_{1}^{a_{1}} \dots p_{m}^{a_{m}},$ where $(\binom{n}{a_{1}, \dots, a_{m}}) = \frac{n!}{a_{1}! \dots a_{m}!}$ .

Poissonization of the Multinomial
Now suppose number of trials is a RV $N \sim Poisson (λ)$ . We want to determine distribution of $(X_{1}, \dots, X_{m})$ . So $\begin{aligned} P (\vec{X} = \vec{a}) & = \sum_{n = 0}^{\infty} P (\vec{X} = \vec{a} | N = n) P (N = n) \\ = \sum_{n = 0}^{\infty} \frac{n!}{a_{1}! \dots a_{m}!} p_{1}^{a_{1}} \dots p_{m}^{a_{m}} 1 {\sum_{j = 1}^{m} a_{j} = n} \cdot \frac{λ^{n}}{n!} e^{- λ} \\ = \frac{p_{1}^{a_{1}} \dots p_{m}^{a_{m}}}{a_{1}! \dots a_{m}!} λ^{a_{1} + \dots + a_{m}} e^{- λ (p_{1} + \dots + p_{m})} \\ = [\frac{(p_{1} λ)^{a_{1}}}{a_{1}!} e^{- p_{1} λ}] \dots [\frac{(p_{m} λ)^{a_{m}}}{a_{m}!} e^{- p_{m} λ}] . \end{aligned}$
So if $N \sim Poisson (λ)$ ,

$X_{1}, \dots, X_{m}$ are independent.
$X_{j} \sim Poisson (p_{j} λ)$ ,

Then $X \sim Multinomial (N, p_{1}, \dots, p_{m}) .$